34%
12.02.2014
One goal of HPC administration is effective monitoring of clusters. In this article, we talk about writing code that measures processor and memory metrics on each node.
...
In an earlier article I discussed how to determine what metrics you might want to watch as part of cluster monitoring, as well as the frequency at which you might want to monitor them. This process ... HPC, memory, processor, monitoring, metrics, processor, memory ...
One goal of HPC administration is effective monitoring of clusters. In this article, we talk about writing code that measures processor and memory metrics on each node.
... Monitoring HPC Systems: Processor and Memory Metrics
26%
22.05.2012
The Warewulf stateless cluster tool is scalable and highly configurable and eases installation, management, and monitoring of HPC clusters.
...
A plethora of cluster tools are out there to help people get started provisioning, managing, and monitoring HPC clusters. One of the best approaches is to use stateless compute nodes, commonly ...
The Warewulf stateless cluster tool is scalable and highly configurable, and it eases the installation, management, and monitoring of HPC clusters.
24%
20.10.2013
Modern drives use S.M.A.R.T. (self-monitoring, analysis, and reporting technology) to gather information and run self-tests. Smartmontools is a Linux tool for interacting with the S ...
S.M.A.R.T. (self-monitoring, analysis, and reporting technology) is a monitoring system for storage devices that provides some information about the status of the drive as well as the ability to run ...
Modern drives use S.M.A.R.T. (self-monitoring, analysis, and reporting technology) to gather information and run self-tests. Smartmontools is a Linux tool for interacting with the S ... S.M.A.R.T., Smartmontools, and Drive Monitoring
22%
14.08.2020
Most storage devices have SMART capability, but can it help you predict failure? We look at ways to take advantage of this built-in monitoring technology with the smartctl utility from the Linux ...
S.M.A.R.T. (Self-Monitoring, Analysis, and Reporting Technology) is a monitoring system for storage devices that provides information about the status of a device and allows for the running of self ...
Most storage devices have SMART capability, but can it help you predict failure? We look at ways to take advantage of this built-in monitoring technology with the smartctl utility from the Linux
15%
20.04.2022
user.comment.name -v "Jeff Layton created this file" test.txt
The list of extended attributes for this file can be created:
$ getfattr test.txt
# file: test.txt
user.comment
user.comment.name
Now
14%
19.11.2014
local or remote, in your browser using websocketd. Although I won't go into it in depth, Web-vmstat does a pretty good job monitoring problem servers. For example, if a node has been exhibiting strange
14%
19.05.2014
with my /home/layton
directory on my local system (host = desktop
). I also access an HPC system that has its own /home/jlayton
directory (the login node is login1
). On the HPC system I only keep some
14%
21.08.2012
Listing 6: Torque Job Script
[laytonjb@test1 TEST]$ more pbs-test_001
1 #!/bin/bash
2 ###
3 ### Sample script for running MPI example for computing PI (Fortran 90 code)
4 ###
5 ### Jeff Layton
14%
01.08.2012
lib/atlas/3.8.4 modulefile
#%Module1.0#####################################################################
##
## modules lib/atlas/3.8.4
##
## modulefiles/lib/atlas/3.8.4 Written by Jeff Layton
14%
01.08.2012
by Jeff Layton
##
proc ModulesHelp { } {
global version modroot
puts stderr “”
puts stderr “The compilers/gcc/4.4.6 module enables the GNU family of”
puts stderr “compilers that came by default